Search CORE

6 research outputs found

Shift-Reduce CCG Parsing with a Dependency Model

Author: Stephen Clark
Wenduan Xu
Yue Zhang
Publication venue: ACL
Publication date: 01/01/2014
Field of study

This paper presents the first dependency model for a shift-reduce CCG parser. Modelling dependencies is desirable for a number of reasons, including handling the “spurious ” ambiguity of CCG; fitting well with the theory of CCG; and optimizing for structures which are evaluated at test time. We develop a novel training technique using a dependency oracle, in which all derivations are hidden. A challenge arises from the fact that the oracle needs to keep track of exponentially many goldstandard derivations, which is solved by integrating a packed parse forest with the beam-search decoder. Standard CCGBank tests show the model achieves up to 1.05 labeled F-score improvements over three existing, competitive CCG parsing models

CiteSeerX

Crossref

Recommended from our members

Structured Learning with Inexact Search: Advances in Shift-Reduce CCG Parsing

Author: Xu Wenduan
Publication venue: University of Cambridge
Publication date: 07/12/2017
Field of study

Statistical shift-reduce parsing involves the interplay of representation learning, structured learning, and inexact search. This dissertation considers approaches that tightly integrate these three elements and explores three novel models for shift-reduce CCG parsing. First, I develop a dependency model, in which the selection of shift-reduce action sequences producing a dependency structure is treated as a hidden variable; the key components of the model are a dependency oracle and a learning algorithm that integrates the dependency oracle, the structured perceptron, and beam search. Second, I present expected F-measure training and show how to derive a globally normalized RNN model, in which beam search is naturally incorporated and used in conjunction with the objective to learn shift-reduce action sequences optimized for the final evaluation metric. Finally, I describe an LSTM model that is able to construct parser state representations incrementally by following the shift-reduce syntactic derivation process; I show expected F-measure training, which is agnostic to the underlying neural network, can be applied in this setting to obtain globally normalized greedy and beam-search LSTM shift-reduce parsers.The Carnegie Trust for the Universities of Scotland; The Cambridge Trus

Apollo (Cambridge)

Don’t Interrupt Me While I Type: Inferring Text Entered Through Gesture Typing on Android Keyboards

Author: Anderson Ross
Simon Laurent
Xu Wenduan
Publication venue: Proceedings on Privacy Enhancing Technologies
Publication date: 01/01/2016
Field of study

We present a new side-channel attack against soft keyboards that support gesture typing on Android smartphones. An application without any special permissions can observe the number and timing of the screen hardware interrupts and system-wide software interrupts generated during user input, and analyze this information to make inferences about the text being entered by the user. System-wide information is usually considered less sensitive than app-specific information, but we provide concrete evidence that this may be mistaken. Our attack applies to all Android versions, including Android M where the SELinux policy is tightened. We present a novel application of a recurrent neural network as our classifier to infer text. We evaluate our attack against the “Google Keyboard” on Nexus 5 phones and use a real-world chat corpus in all our experiments. Our evaluation considers two scenarios. First, we demonstrate that we can correctly detect a set of pre-defined “sentences of interest” (with at least 6 words) with 70% recall and 60% precision. Second, we identify the authors of a set of anonymous messages posted on a messaging board. We find that even if the messages contain the same number of words, we correctly re-identify the author more than 97% of the time for a set of up to 35 sentences. Our study demonstrates a new way in which system-wide resources can be a threat to user privacy. We investigate the effect of rate limiting as a countermeasure but find that determining a proper rate is error-prone and fails in subtle cases. We conclude that real-time interrupt information should be made inaccessible, perhaps via a tighter SELinux policy in the next Android version.This work was partially supported by the Samsung Electronics Research Institute (SERI), Thales, and the Carnegie Trust for the Universities of Scotland

Edinburgh Research Explorer

Apollo (Cambridge)

MicroRNA expression in esophageal squamous cell carcinoma: Novel diagnostic and prognostic biomarkers

Author: Altman
Barrett
Baxter
Benjamini
Bolstad
Chao Feng
Chen
Chen
Donglin Wang
Dweep
Enright
Enzinger
Gholamin
Guo
Haixin Yu
Hastie
Hiroki
Hsu
Jiang
Jinnan Zhang
Kano
Kertesz
Krek
Lee
Lee
Lewis
Lin
Lin
Liu
Ma
Ma
Mandard
Masoudi-Nejad
Ni
Nohata
Robin
Schriml
Shannon
Shengtao Shang
Sheyhidin
Smyth
Stahl
Swingler
Therneau
Wang
Ward
Wei Zhao
Wenduan Ma
Wu
Wu
Xiao
Xu
Yan Wang
Yang
Zhang
Zhang
Zhao
Znaor
Publication venue: 'Spandidos Publications'
Publication date
Field of study

Crossref